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(2) INFORMATION FOR SEQ ID NO : 1: 



(A) LENGTHS, 1522 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

<ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 
(iv) ANT I -SENSE: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1: 

ACAGTCAGCC GCATGGCTCC CCTGTGCCCC AGCCCCTGGC TCCCTCTGTT GATCCCGGCC 60 

CCTGCTCCAG GCCTCACTGT GCAACTGCTg\ CTGTCACTGC TGCTTCTGAT GCCTGTCCAT 12 0 

CCCCAGAGGT TGCCCCGGAT GCAGGAGGAT T^CCCCTTGG GAGGAGGCTC TTCTGGGGAA 18 0 

GATGACCCAC TGGGCGAGGA GGATCTGCCC AG^GAAGAGG ATTCACCCAG AGAGGAGGAT 240 

CCACCCGGAG AGGAGGATCT AC CTGGAGAG GAG6ATCTAC CTGGAGAGGA GGATCTACCT 3 00 

GAAGTTAAGC CTAAATCAGA AGAAGAGGGC TCCCTGAAGT TAGAGGATCT ACCTACTGTT 3 60 

GAGGCTCCTG GAGATCCTCA AGAACCC CAG AATAATGCCC ACAGGGACAA AGAAGGGGAT 42 0 

GACCAGAGTC ATTGGCGCTA TGGAGGCGAC CCGCCCTGGC CCCGGGTGTC CCCAGCCTGC 4 80 

GCGGGCCGCT TCCAGTCCCC GGTGGATATC CGCCCCCAGC TCGCCGCCTT CTGCCCGGCC 54 0 

CTGCGCCCCC TGGAACTCCT GGGCTTCCAG CTCCCGCCGc\ TCCCAGAACT GCGCCTGCGC 600 

AACAATGGCC ACAGTGTGCA ACTGACCCTG CCTCCTGGGC TAGAGATGGC TCTGGGTCCC 660 

GGGCGGGAGT ACCGGGCTCT GCAGCTGCAT CTGCACTGGG GGGCTGCAGG TCGTCCGGGC 720 

TCGGAGCACA CTGTGGAAGG CCACCGTTTC CCTGCCGAGA TCGACGTGGT TCACCTCAGC 780 

\ 

\ 



ACCGCCTTTG CCAGAGTTGAl CGAGGCCTTG GGGCGCCCGG GAGGCCTGGC CGTGTTGGCC 84 0 

GCCTTTCTGG AGGAGGGCCC WaAGAAAAC AGTGCCTATG AGCAGTTGCT GTCTCGCTTG 90 0 

GAAGAAATCG CTGAGGAAGG CTCAGAGACT CAGGTCCCAG GACTGGACAT ATCTGCACTC 960 

CTGCCCTCTG ACTTCAGCCG CTACTTCCAA TATGAGGGGT CTCTGACTAC ACCGCCCTGT 102 0 

GCCCAGGGTG TCATCTGGAC TGTg\tTAAC CAGACAGTGA TGCTGAGTGC TAAGCAGCTC 108 0 

CACACCCTCT CTGACACCCT GTGGg\aCCT GGTGACTCTC GGCTACAGCT GAACTTCCGA 114 0 

GCGACGCAGC CTTTGAATGG GCGAGTGATT GAGGCCTCCT TCCCTGCTGG AGTGGACAGC 12 0 0 

AGTCCTCGGG CTGCTGAGCC AGTCCAGCTG AATTCCTGCC TGGCTGCTGG TGACATCCTA 12 6 0 

GCCCTGGTTT TTGGCCTCCT TTTTGCTGTC ACCAGCGTCG CGTTCCTTGT GCAGATGAGA 132 0 

AGGCAGCACA GAAGGGGAAC CAAAGGGGGT\ GTGAGCTACC GCCCAGCAGA GGTAGC CGAG 13 8 0 

ACTGGAGCCT AGAGGCTGGA TCTTGGAGAA TGTGAGAAGC CAGCCAGAGG CATCTGAGGG 144 0 

GGAGCCGGTA ACTGTCCTGT CCTGCTCATT ATGCCACTTC CTTTTAACTG CCAAGAAATT 15 00 

TTTTAAAATA AATATTTATA AT \ 152 2 
(2) INFORMATION FOR SEQ ID NO: 2: 




(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 459 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(A) DESCRIPTION: First 3 7 amino acids represent 
signal peptide, and remaining amino acids 
represent mature protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO 

Met Ala Pro Leu Cys Pro Ser Pro Trp Le\i Pro Leu Leu lie Pro Ala 
-35 -30 \ -25 

\ 

\ 



Pro Ala Pro Gly Leu Vhr Val Gin Leu Leu Leu Ser Leu Leu Leu Leu 

-20 \ -15 -10 

Met Pro Val His Pro Glri Arg Leu Pro Arg Met Gin Glu Asp Ser Pro 
-5 1 \ 5 10 



Leu Gly Gly Gly Ser Ser Giy Glu Asp Asp Pro Leu Gly Glu Glu Asp 
15 \ 20 25 



Leu Pro Ser Glu Glu Asp Ser\Pro Arg Glu Glu Asp Pro Pro Gly Glu 
30 \35 40 

Glu Asp Leu Pro Gly Glu Glu Asp Leu Pro Gly Glu Glu Asp Leu Pro 

50 V 55 

Glu Val Lys Pro Lys Ser Glu Glu\Glu Gly Ser Leu Lys Leu Glu Asp 
60 65 \ 70 75 

Leu Pro Thr Val Glu Ala Pro Gly Akp Pro Gin Glu Pro Gin Asn Asn 
80 \ 85 90 

Ala His Arg Asp Lys Glu Gly Asp Asp\ Gin Ser His Trp Arg Tyr Gly 

100 \ 105 

Gly Asp Pro Pro Trp Pro Arg Val Ser pVo Ala Cys Ala Gly Arg Phe 
110 115 \ 120 

Gin Ser Pro Val Asp lie Arg Pro Gin Leu\ Ala Ala Phe Cys Pro Ala 
125 130 \ 135 

Leu Arg Pro Leu Glu Leu Leu Gly Phe Gin Leu Pro Pro Leu Pro Glu 
140 145 ISO 155 

Leu Arg Leu Arg Asn Asn Gly His Ser Val Glm Leu Thr Leu Pro Pro 
160 165 \ 170 

Gly Leu Glu Met Ala Leu Gly Pro Gly Arg Glu Wr Arg Ala Leu Gin 
175 180 \ 185 

Leu His Leu His Trp Gly Ala Ala Gly Arg Pro Gly Ser Glu His Thr 
190 195 \ 200 



Val Glu Gly His Arg Phe Pro Ala Glu lie His Val\ Val His Leu Ser 
205 210 215 ^ 



Thr Ala Phe Ala Arg\Val Asp Glu Ala Leu Gly Arg Pro Gly Gly Leu 
220 "'225 230 235 

Ala Val Leu Ala Ala Phe Leu Glu Glu Gly Pro Glu Glu Asn Ser Ala 
240 \ 245 250 

Tyr Glu Gin Leu Leu Ser Xrg Leu Glu Glu lie Ala Glu Glu Gly Ser 
255 \ 260 265 

Glu Thr Gin Val Pro Gly Leu\ Asp lie Ser Ala Leu Leu Pro Ser Asp 
270 \275 280 

Phe Ser Arg Tyr Phe Gin Tyr Glu Gly Ser Leu Thr Thr Pro Pro Cys 
285 290 \ 295 

Ala Gin Gly Val lie Trp Thr ValVphe Asn Gin Thr Val Met Leu Ser 
300 305 \ 310 315 

Ala Lys Gin Leu His Thr Leu Ser Asp Thr Leu Trp Gly Pro Gly Asp 
320 V 325 330 

Ser Arg Leu Gin Leu Asn Phe Arg Ala\Thr Gin Pro Leu Asn Gly Arg 
335 340 \ 345 

Val lie Glu Ala Ser Phe Pro Ala Gly Val Asp Ser Ser Pro Arg Ala 
350 355 \ 360 

Ala Glu Pro Val Gin Leu Asn Ser Cys LeA Ala Ala Gly Asp lie Leu 
365 370 \ 375 

Ala Leu Val Phe Gly Leu Leu Phe Ala Val Thr Ser Val Ala Phe Leu 
380 385 3B0 395 

Val Gin Met Arg Arg Gin His Arg Arg Gly Thr Lys Gly Gly Val Ser 
400 405 \ 410 

Tyr Arg Pro Ala Glu Val Ala Glu Thr Gly Ala 1 
415 420 



INFORMATION FOR SEQ ID NO : 3: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 9 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 



\ 



(D) TOPOLOGY: linear 
(ii) MOLECULE type\ DNA (genomic) 
(iii) HYPOTHETICAL: WO 
(iv) ANTI- SENSE: YES \ 



(xi) SEQUENCE DESCRIPTION : SEQ ID NO : 3: 
CGCCCAGTGG GTCATCTTCC CCAGAAQAG 
(2) INFORMATION FOR SEQ ID NO :\ 4 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 base plirs 

(B) TYPE: nucleic acidl 

(C) STRANDEDNESS : singBe 

(D) TOPOLOGY: linear \ 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO \ 
(iv) ANTI -SENSE: YES \ 



(xi) SEQUENCE DESCRIPTION: SEQ ID : 
GGAATCCTCC TGCATCCGG 
(2) INFORMATION FOR SEQ ID NO: 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10898 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(iii) HYPOTHETICAL: NO 
(iv) ANTI - SENSE :\ NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 5: 

GGATCCTGTT GACTCGTGAC CTTACCCCCA ACCCTGTGCT CTCTGAAACA TGAGCTGTGT 60 

CCACTCAGGG TTAAATGGAT TAAGGGCGGT GCAAGATGTG CTTTGTTAAA CAGATGCTTG 12 0 

AAGGCAGCAT GCTCGTTAAG AGTCATCACC AATCCCTAAT CTCAAGTAAT CAGGGACACA 180 

AACACTGCGG AAGGCCGCAG GGTCCTCTtGC CTAGGAAAAC CAGAGACCTT TGTTCACTTG 2 40 

TTTATCTGAC CTTCCCTCCA CTATTGTCOA TGACCCTGCC AAATCCCCCT CTGTGAGAAA 3 00 

CACCCAAGAA TTATCAATAA AAAAATAAATl TTAAAAAAAA AATACAAAAA AAAAAAAAAA 3 60 

AAAAAAAAAA GACTTACGAA TAGTTATTGA VaAATGAATA GCTATTGGTA AAGCCAAGTA 420 

AATGATCATA TTCAAAACCA GAC GGC CAT C ATCACAGCTC AAGTCTACCT GATTTGATCT 4 80 

CTTTATCATT GTCATTCTTT GGATTCACTA GATTAGTCAT CATCCTCAAA ATTCTCCCCC 54 0 

AAGTTCTAAT TACGTTCCAA ACATTTAGGG GTTACATGAA GCTTGAACCT ACTACCTTCT 6 00 

TTGCTTTTGA GCCATGAGTT GTAGGAATGA TGAGTTTACA CCTTACATGC TGGGGATTAA 660 

TTTAAACTTT ACCTCTAAGT CAGTTGGGTA GCCTTTSGGCT TATTTTTGTA GCTAATTTTG 72 0 

TAGTTAATGG ATGC AC TGTG AATCTTGCTA TGATAGTTTT CCTCCACACT TTGCCACTAG 780 

GGGTAGGTAG GTACTCAGTT TTCAGTAATT GCTTACCTJ^A GACCCTAAGC CCTATTTCTC 84 0 

TTGTACTGGC CTTTATCTGT AATATGGGCA TATTTAATAC AATATAATTT TTGGAGTTTT 900 

TTTGTTTGTT TGTTTGTTTG TTTTTTTGAG ACGGAGTCTtX GCATCTGTCA TGCCCAGGCT 9 60 

GGAGTAGCAG TGGTGC CAT C TCGGCTCACT GCAAGCTCCA GCTCCCGAGT TCACGCCATT 102 0 

TTCCTGCCTC AGCCTCCCGA GTAGCTGGGA CTACAGGCGC CGGCCACCAT GCCCGGCTAA 10 8 0 

\ 

\ 



TTTTTTGTAT TTTTGOTAGA GACGGGGTTT CACCGTGTTA GCCAGAATGG TCTCGATCTC 114 0 

CTGACTTCGT GATCCACC^G CCTCGGCCTC CCAAAGTTCT GGGATTACAG GTGTGAGCCA 12 0 0 

CCGCACCTGG CCAATTTTTtXgAGTCTTTTA AAGTAAAAAT ATGTCTTGTA AGCTGGTAAC 126 0 

TATGGTACAT TTCCTTTTAT TAATGTGGTG CTGACGGTCA TATAGGTTCT TTTGAGTTTG 132 0 

GCATGCATAT GCTACTTTTT GCAGTCCTTT CATTACATTT TTCTCTCTTC ATTTGAAGAG 138 0 

CATGTTATAT CTTTTAGCTT CAC'ETGGCTT AAAAGGTTCT CTCATTAGCC TAACACAGTG 144 0 

TCATTGTTGG TACCACTTGG ATCATAAGTG GAAAAACAGT CAAGAAATTG CACAGTAATA 150 0 

CTTGTTTGTA AGAGGGATGA TTCAGGTGAA TCTGACACTA AGAAACTCCC CTACCTGAGG 156 0 

TCTGAGATTC CTCTGACATT GCTGTATAffA GGCTTTTCCT TTGACAGCCT GTGACTGCGG 162 0 

ACTATTTTTC TTAAGCAAGA TATGCTAAAS TTTTGTGAGC CTTTTTCCAG AGAGAGGTCT 168 0 

CATATCTGCA T CAAGTGAGA ACATATAATG VrCTGCATGTT TCCATATTTC AGGAATGTTT 174 0 

GCTTGTGTTT TATGCTTTTA TATAGACAGG gVaACTTGTT CCTCAGTGAC CCAAAAGAGG 180 0 

TGGGAATTGT TATTGGATAT CATCATTGGC CCACGCTTTC TGACCTTGGA AACAATTAAG 186 0 

GGTTCATAAT CTCAATTCTG TCAGAATTGG TACA&GAAAT AGCTGCTATG TTTCTTGACA 192 0 

TTCCACTTGG TAGGAAATAA GAATGTGAAA CTCTTEAGTT GGTGTGTGTC CCTNGTTTTT 198 0 

TTGCAATTTC CTTCTTACTG TGTTAAAAAA AAGTATQATC TTGCTCTGAG AGGTGAGGCA 2 04 0 

TTCTTAATCA TGATCTTTAA AGATCAATAA TATAATCOTT TCAAGGATTA TGTCTTTATT 210 0 

ATAATAAAGA TAATTTGTCT TTAACAGAAT CAATAATATA ATCCCTTAAA GGATTATATC 216 0 

TTTGCTGGGC GCAGTGGCTC ACACCTGTAA TCCCAGCACT\ TTGGGTGGCC AAGGTGGAAG 222 0 

GATCAAATTT GCCTACTTCT ATATTATCTT CTAAAGCAGA ATTCATCTCT CTTCCCTCAA 22 80 

TATGAT GAT A TTGACAGGGT TTGCCCTCAC TCACTAGATT GTGAGCTCCT GCTCAGGGCA 2 34 0 

GGTAGCGTTT TTTGTTTTTG TTTTTGTTTT TCTTTTTTGA GAG|AGGGTCT TGCTCTGTCA 24 0 0 

CCCAGGCCAG AGTGCAATGG TACAGTCTCA GCTCACTGCA GCCTCAACCG CCTCGGCTCA 246 0 



AACCATCATC CCATTTCAGC CTCCTGAGTA GCTGGGACTA CAGGCACATG CCATTACACC 2 52 0 

TGGCTAATTT TTTTGTATTT CTAGTAGAGA CAGGGTTTGG CCATGTTGCC CGGGCTGGTC 2 5 80 

TCGAACTCCT GGACTCAAGC AATCCACCCA CCTCAGCCTC CCAAAATGAG GGACCGTGTC 2 640 

TTATTCATTT CCATGTCCCT AGTCCATAGC CCAGTGCTGG ACCTATGGTA GTACTAAATA 2 7 00 

AATATTTGTT GAATGCAATAV GTAAATAGCA TTTCAGGGAG CAAGAACTAG ATTAACAAAG 2 7 60 

GTGGTAAAAG GTTTGGAGAA AAAAATAATA GTTTAATTTG GCTAGAGTAT GAGGGAGAGT 2 820 

AGTAGGAGAC AAGATGGAAA GGTCTCTTGG GCAAGGTTTT GAAGGAAGTT GGAAGTCAGA 2 8 80 

AGTACACAAT GTGCATATCG TGGCAGGCAG TGGGGAGCCA ATGAAGGCTT TTGAGCAGGA 2 940 

GAGTAATGTG T T GAAAAAT A AATaVaGGTT AAACCTATCA GAGCCCCTCT GACACATACA 3 000 

CTTGCTTTTC ATTCAAGCTC AAGTTTGTCT CCCACATACC CATTACTTAA CTCACCCTCG 3 0 60 

GGCTCCCCTA GCAGCCTGCC CTACCTQTTT ACCTGCTTCC TGGTGGAGTC AGGGATGTAT 312 0 

ACATGAGCTG CTTTCCCTCT CAGCCAGAGG ACATGGGGGG CCCCAGCTCC CCTGCCTTTC 3180 

CCCTTCTGTG CCTGGAGCTG GGAAGCAGGC CAGGGTTAGC TGAGGCTGGC TGGCAAGCAG 3 24 0 

CTGGGTGGTG CCAGGGAGAG CCTGCATAGt\gCCAGGTGGT GCCTTGGGTT CCAAGCTAGT 3 3 00 

CCATGGCCCC GATAACCTTC TGCCTGTGCA GACACCTGCC CCTCACTCCA CCCCCATCCT 3 3 60 

AGCTTTGGTA TGGGGGAGAG GGCACAGGGC CAGACAAACC TGTGAGACTT TGGCTCCATC 3 42 0 

TCTGCAAAAG GGCGCTCTGT GAGTCAGCCT GCBCCCCTCC AGGCTTGCTC CTCCCCCACC 3 4 80 

CAGCTCTCGT TTCCAATGCA CGTACAGCCC GTACACACCG TGTGCTGGGA CACCCCACAG 3 54 0 

TCAGCCGCAT GGCTCCCCTG TGCCCCAGCC CCTGGCTCCC TCTGTTGATC CCGGCCCCTG 3 6 00 

CTCCAGGCCT CACTGTGCAA CTGCTGCTGT CACTGC1GCT TCTGGTGCCT GTCCATCCCC 3 6 60 

AGAGGTTGCC CCGGATGCAG GAGGATTCCC CCTTGGGAGG AGGCTCTTCT GGGGAAGATG 3 72 0 

ACCCACTGGG CGAGGAGGAT CTGCCCAGTG AAGAGGATTG^ ACCCAGAGAG GAGGATCCAC 3 78 0 

CCGGAGAGGA GGATCTACCT GGAGAGGAGG ATCTACCTGG \aGAGGAGGAT CTACCTGAAG 3 840 



\ 



TTAAGCCTAA ATCAGAAGAA GAGGGCTCCC TGAAGTTAGA GGATCTACCT ACTGTTGAGG 3 90 0 

CTCCTGGAGA TCCTCAAGAA\ CCCCAGAATA ATGCCCACAG GGACAAAGAA GGTAAGTGGT 3 960 

CATCAATCTC CAAATCCAGG WcCAGGAGG TTCATGACTC CCCTCCCATA CCCCAGCCTA 4 02 0 

GGCTCTGTTC ACTCAGGGAA GGAGGGGAGA CTGTACTCCC CACAGAAGCC CTTCCAGAGG 4 08 0 

TCCCATACCA ATATCCCCAT CCCGACTCTC GGAGGTAGAA AGGGACAGAT GTGGAGAGAA 414 0 

AATAAAAAGG GTGCAAAAGG AGAGAGGTGA GCTGGATGAG ATGGGAGAGA AGGGGGAGGC 42 0 0 

TGGAGAAGAG AAAGGGATGA GAACTGCAGA TGAGAGAAAA AATGTGCAGA CAGAGGAAAA 42 60 

AAATAGGTGG AGAAGGAGAG TCAGAgAgTT TGAGGGGAAG AGAAAAGGAA AGCTTGGGAG 432 0 

GTGAAGTGGG T AC C AG AG AC AAGCAAgAaG AGCTGGTAGA AGTCATCTCA TCTTAGGCTA 4 3 80 

CAATGAGGAA TTGAGACCTA GGAAGAAGGG ACACAGCAGG TAGAGAAACG TGGCTTCTTG 444 0 

ACTCCCAAGC CAGGAATTTG GGGAAAGGGg\ TTGGAGACCA TACAAGGCAG AGGGATGAGT 4500 

GGGGAGAAGA AAGAAGGGAG AAAGGAAAGA TGGTGTACTC ACTCATTTGG GACTCAGGAC 4560 

TGAAGTGCCC ACTCACTTTT TTTTTTTTTT TTTTTGAGAC AAACTTTCAC TTTTGTTGCC 4 620 

CAGGCTGGAG TGCAATGGCG CGATCTCGGC TOSCTGCAAC CTCCACCTCC CGGGTTCAAG 4 6 80 

TGATTCTCCT GCCTCAGCCT CTAGCCAAGT AGCTGCGATT ACAGGCATGC GCCACCACGC 4 74 0 

CCGGCTAATT TTTGTATTTT TAGTAGAGAC GGGGTTTCGC CATGTTGGTC AGGCTGGTCT 4 8 00 

CGAACTCCTG ATCTCAGGTG ATCCAACCAC CCTGGCQTCC CAAAGTGCTG GGATTATAGG 4 8 60 

CGTGAGCCAC AGCGCCTGGC CTGAAGCAGC CACTCACTOT TACAGACCCT AAGACAATGA 4 92 0 

TTGCAAGCTG GTAGGATTGC TGTTTGGCCC ACCCAGCTGt GGTGTTGAGT TTGGGTGCGG 4 98 0 

TCTCCTGTGC TTTGCACCTG GCCCGCTTAA GGCATTTGTt\ ACCCGTAATG CTCCTGTAAG 504 0 

GCATCTGCGT TTGTGACATC GTTTTGGTCG CCAGGAAGGG ATTGGGGCTC TAAGCTTGAG 510 0 

CGGTTCATCC TTTTCATTTA TACAGGGGAT GACCAGAGTC ATTGGCGCTA TGGAGGTGAG 516 0 

ACACCCACCC GCTGCACAGA CCCAATCTGG GAACCCAGCT CTKTGGATCT CCCCTACAGC 522 0 



AGCTAATTTT TTTTTGTATtX TTTAGTAGAC AGGGTTTCAC CATGTTGGTC AGGCTGGTCT 66 60 

CAAACTCCTG GCCTCAAGTG aVcCGCCTGA CTCAGCCTAC CAAAGTGCTG ATTACAAGTG 672 0 

TGAGCCACCG TGCCCAGCCA CAGTCACTGA TTCTTTAATG CCAGCCACAC AGCACAAAGT 67 8 0 

TCAGAGAAAT GCCTCCATCA TAGGATGTCA ATATGTTCAT ACTCTTAGGT TCATGATGTT 6840 

CTTAACATTA GGTTCATAAG CAAAATAAGA AAAAAGAATA ATAAATAAAA GAAGTGGCAT 6900 

GTCAGGACCT CACCTGAAAA GCCAAAGACA GAATCATGAA GGTGAATGCA GAGGTGACAC 69 60 

CAACACAAAG GTGTATATAT GGTTTCCTKST GGGGAGTATG TACGGAGGCA GCAGTGAGTG 702 0 

AGACTGCAAA CGTCAGAAGG GCACGGGT OA CTGAGAGCCT AGTATCCTAG TAAAGTGGGC 70 80 

TCTCTCCCTC TCTCTCCAGC TTGTCATTGA AAACCAGTCC ACCAAGCTTG TTGGTTCGCA 714 0 

CAGCAAGAGT ACATAGAGTT TGAAATAATa\ CATAGGATTT TAAGAGGGAG ACACTGTCTC 72 0 0 

TAAAAAAAAA AACAACAGCA ACAACAAAAA GCAACAACCA TTACAATTTT ATGTTCCCTC 72 60 

AGCATTCTCA GAGCTGAGGA ATGGGAGAGG ACTATGGGAA CCCCCTTCAT GTTCCGGCCT 732 0 

TCAGCCATGG CCCTGGATAC ATGCACTCAT CTGTCTTACA ATGTCATTCC CCCAGGAGGG 73 8 0 

CCCGGAAGAA AACAGTGCCT ATGAGCAGTT GCTGTCTCGC TTGGAAGAAA TCGCTGAGGA 744 0 

AGGTCAGTTT GTTGGTCTGG CCACTAATCT CTGCTGGCCTA GTTCATAAAG AATCACCCTT 75 00 

TGGAGCTTCA GGTCTGAGGC TGGAGATGGG CTCQCTCCAG TGCAGGAGGG ATTGAAGCAT 7 560 

GAGCCAGCGC TCATCTTGAT AATAAC CATG AAGCTGACAG ACACAGTTAC CCGCAAACGG 762 0 

CTGCCTACAG ATTGAAAACC AAGCAAAAAC CGCCGGGCAC GGTGGCTCAC GCCTGTAATC 768 0 

CCAGCACTTT GGGAGGCCAA GGCAGGTGGA TCACGA^GTC AAGAGATCAA GACCATCCTG 774 0 

GCCAACATGG TGAAACCCCA TCTCTACTAA AAATACGAAA AAATAGCCAG GCGTGGTGGC 780 0 

GGGTGCCTGT AATCCCAGCT ACTCGGGAGG CTGAGGCAGG AGAATGGCAT GAACC CGGGA 7 86 0 

GGCAGAAGTT GCAGTGAGCC GAGATCGTGC CACTGCACTC CAGCCTGGGC AACAGAGCGA 792 0 

GACTCTTGTC TCAAAAAAAA AAAAAAAAAA GAAAACCAAG CAAAAACCAA AATGAGACAA 798 0 



AAAAAACAAG ACCAAAAAAT GGTGTTTGGA AATTGTCAAG GTCAAGTCTG GAGAGCTAAA 804 0 

CTTTTTCTGA GAACTGTTTA TcWtAATAA GCATCAAATA TTTTAACTTT GTAAATACTT 810 0 

TTGTTGGAAA TCGTTCTCTT CTTAGTCACT CTTGGGTCAT TTTAAATCTC ACTTACTCTA 816 0 

CTAGACCTTT TAGGTTTCTG CTAGACTAGG TAGAACTCTG CCTTTGCATT TCTTGTGTCT 822 0 

GTTTTGTATA GTTATCAATA TTCATATTTA TTTACAAGTT ATTCAGATCA TTTTTTCTTT 828 0 

TCTTTTTTTT TTTTTTTTTT TTTTTTAEAT CTTTAGTAGA GACAGGGTTT CACCATATTG 834 0 

GCCAGGCTGC TCTCAAACTC CTGACCTTGT GATCCACCAG CCTCGGCCTC CCAAAGTGCT 840 0 

GGGATTCATT TTTTCTTTTT AATTTGCTCT GGGC TTAAAC TTGTGGCCCA GCACTTTATG 846 0 

ATGGTACACA GAGTTAAGAG TGTAGACTCa\gACGGTCTTT CTTCTTTCCT TCTCTTCCTT 852 0 

CCTCCCTTCC CTCCCACCTT CCCTTCTCTC GTTCCTTTCT TTCTTCCTCT CTTGCTTCCT 858 0 

CAGGCCTCTT CCAGTTGCTC CAAAGCCCTG TACTTTTTTT TGAGTTAACG TCTTATGGGA 864 0 

AGGGCCTGCA CTTAGTGAAG AAGTGGTCTC AGAlGTTGAGT TACCTTGGCT TCTGGGAGGT 870 0 

GAAACTGTAT CCCTATACCC TGAAGCTTTA AGGGGGTGCA ATGTAGATGA GACCCCAACA 8 76 0 

TAGATCCTCT TCACAGGCTC AGAGACTCAG GTCCSAGGAC TGGACATATC TGCACTCCTG 882 0 

CCCTCTGACT TCAGCCGCTA CTTCCAATAT GAGGGSTCTC TGACTACACC GCCCTGTGCC 88 8 0 

CAGGGTGTCA TCTGGACTGT GTTTAACCAG ACAGTGATGC TGAGTGCTAA GCAGGTGGGC 894 0 

CTGGGGTGTG TGTGGACACA GTGGGTGCGG GGGAAAGAGG ATGTAAGATG AGATGAGAAA 900 0 

CAGGAGAAGA AAGAAATCAA GGCTGGGCTC TGTGGCTTAC GCCTATAATC CCACCACGTT 90 60 

GGGAGGCTGA GGTGGGAGAA TGGTTTGAGC C C AGGAGT'lC AAGACAAGGC GGGGCAACAT 912 0 

AGTGTGACCC CATCTCTACC AAAAAAACCC CAACAAAAC u AAAAAT AGC C GGGCATGGTG 9180 

GTATGCGGCC TAGTCCCAGC TACTCAAGGA GGCTGAGGTG \GGAAGATCGC TTGATTCCAG 924 0 

GAGT TTGAGA CTGCAGTGAG CTATGATCCC ACCACTGCCT ACCATCTTTA GGATACATTT 93 00 

ATTTATTTAT AAAAGAAATC AAGAGGCTGG ATGGGGAATA CAGGAGCTGG AGGGTGGAGC 93 60 



CCTGAGGTGC TGGTTGTGAG CTGGCCTGGG ACCCTTGTTT CCTGTCATGC CATGAACCCA 942 0 

CCCACACTGT CCACTGACCT CCCTAGCTCC ACACCCTCTC TGACACCCTG TGGGGACCTG 94 8 0 

GTGACTCTCG GCTACAGCTG AACTTCCGAG CGACGCAGCC TTTGAATGGG CGAGTGATTG 9 540 

AGGCCTCCTT CCCTGCTGGA GTGGACAGCA GTCCTCGGGC TGCTGAGCCA GGTACAGCTT 96 0 0 

TGTCTGGTTT CCCCCCAGCC AGTAGTCCQT TATCCTCCCA TGTGTGTGCC AGTGTCTGTC 9660 

ATTGGTGGTC ACAGCCCGCC TCTCACATCT, CCTTTTTCTC TCCAGTCCAG CTGAATTCCT 972 0 

GCCTGGCTGC TGGTGAGTCT GCCCCTCCTC \TTGGTCCTGA TGCCAGGAGA CTCCTCAGCA 97 8 0 

CCATTCAGCC CCAGGGCTGC TCAGGACCGC GTCTGCTCCC TCTCCTTTTC TGCAGAACAG 984 0 

ACCCCAACCC CAATATTAGA GAGGCAGATC ATGGTGGGGA TTCCCCCATT GTCCCCAGAG 99 0 0 

GC TAATTGAT TAGAATGAAG CTTGAGAAAT CTffCCAGCAT CCCTCTCGCA AAAGAATCCC 99 60 

CCCCCCTTTT TTTAAAGATA GGGTCTCACT CTGTTTGCCC CAGGCTGGGG TGTTGTGGCA 1002 0 

CGATCATAGC TCACTGCAGC CTCGAACTCC TAGGCTCAGG CAATCCTTTC ACCTTAGCTT 100 80 

CTCAAAGCAC TGGGACTGTA GGCATGAGCC ACTGTGCCTG GCCCCAAACG GCCCTTTTAC 1014 0 

TTGGCTTTTA GGAAGCAAAA ACGGTGCTTA TCTTACGCCT TCTCGTGTAT CCACCCTCAT 10200 

CCCTTGGCTG GCCTCTTCTG GAGACTGAGG CACTATGGGG CTGCCTGAGA ACTCGGGGCA 10260 

GGGGTGGTGG AGTGCACTGA GGCAGGTGTT GAGGAACTQT GCAGACCCCT CTTCCTTCCC 10320 

AAAGCAGCCC TCTCTGCTCT CCATCGCAGG TGACATCCtA GCCCTGGTTT TTGGCCTCCT 103 80 

TTTTGCTGTC ACCAGCGTCG CGTTCCTTGT GCAGATGAGA VgGCAGCACA GGTATTACAC 10440 

TGACCCTTTC TTCAGGCACA AGCTTCCCCC AC CCTTGTGG a)gJTCACTTCA TGCAAAGCGC 10500 

ATGCAAATGA GCTGCTCCTG GGCCAGTTTT CTGATTAGCC TTITCCTGTTG TGTACACACA 10560 

GAAGGGGAAC CAAAGGGGGT GTGAGCTACC GC CCAGCAGA GGTAGCCGAG ACTGGAGCCT 1062 0 

AGAGGCTGGA TCTTGGAGAA TGTGAGAAGC CAGCCAGAGG CATdTGAGGG GGAGCCGGTA 10680 

ACTGTCCTGT CCTGCTCATT ATGCCACTTC CTTTTAACTG CCAAQAAATT TTTTAAAATA 1074 0 



AATATTTATA ATAAAATATG TGTTAGTCAC CTTTGTTCCC CAAATCAGAA GGAGGTATTT 108 00 



GAATTTCCTA TTACTGTTAT TAQCACCAAT TTAGTGGTAA TGCATTTATT CTATTACAGT 108 60 
TCGGCCTCCT TCCACACATC ACTCCAATGT GTTGCTCC 108 98 

(2) INFORMATION FOR SEQ IDl NO : 6: 

(i) SEQUENCE CHARACTER 

(A) LENGTH: 3 7 ankno acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : \ 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(A) DESCRIPTION: StLgnal peptide 

(xi) SEQUENCE DESCRIPTION :\ SEQ ID NO: 6: 

Met Ala Pro Leu Cys Pro Ser Pro Trp Leu Pro Leu Leu lie Pro Ala 
1 5 

Pro Ala Pro Gly Leu Thr VaS Gin Leu Leu Leu Ser Leu Leu Leu Leu 
20 \ 25 30 

Met Pro Val His Pro 
35 

(2) INFORMATION FOR SEQ ID NO: 7: 

(i) SEQUENCE CHARACTERISTICS \ 

(A) LENGTH: 25 base pai^s 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single\ 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc \ "primer" 

(iii) HYPOTHETICAL: NO 

(iv) ANTI- SENSE : YES 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 
TGGGGTTCTT GAGGATCTCC AGGAG 
(2) INFORMATION FOR SEQ ID\NO: 8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 6 bake pairs 

(B) TYPE: nucleic \acid 

(C) STRANDEDNESS : ingle 

(D) TOPOLOGY: linekr 

(ii) MOLECULE TYPE: other\ nucleic acid 

(A) DESCRIPTION: Adesc = "primer" 

(iii) HYPOTHETICAL: NO 



(xi) SEQUENCE DESCRIPTION: SEQ\ ID NO: 8: 
CTCTAACTTC AGGGAGCCCT CTTCTT 
(2) INFORMATION FOR SEQ ID NO: 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 8 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid\ 
(A) DESCRIPTION: /desc = "prime^ 

(iii) HYPOTHETICAL: NO 

(ix) FEATURE: 

(D) OTHER INFORMATION: N stands for ikosine 



(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 9: 



CUACUACUAC UAGGCCACGC GCTCGACTAGT AC GGGNNGGG NNGGGNNG 
(2) INFORMATION FOR SEQ \ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: pebtide 
(v) FRAGMENT TYPE: internal 



(xi) SEQUENCE DESCRIPTION!: SEQ ID NO: 10: 



Glu Glu Asp Leu Pro Ser 
1 5 

(2) INFORMATION FOR SEQ ID NO: ll : 

(i) SEQUENCE CHARACTERISTICS 

(A) LENGTH: 6 amino acj\ds 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 
(v) FRAGMENT TYPE: internal 



(ix) FEATURE: 

(A) NAME/ KEY: Peptide 

(B) LOCATION: 5 5. .60 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11 i\ 



Gly Glu Asp Asp Pro\Leu 
1 5 

(2) INFORMATION FOR SEQ ID< NO : 12: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 artkno acids 

(B) TYPE: amino aqid 

( C ) STRANDEDNESS : 

(D) TOPOLOGY : linekr 

(ii) MOLECULE TYPE: peptitie 
(v) FRAGMENT TYPE: internal 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 

Asn Asn Ala His Arg Asp Lys\Glu Gly Asp Asp Gin Ser His Trp Arg 
1 5 \ 10 15 

Tyr Gly Gly Asp Pro 
20 

(2) INFORMATION FOR SEQ ID NO: 13: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 16 amino acids\ 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 
(v) FRAGMENT TYPE: internal 



(ix) FEATURE: 

(A) NAME /KEY: Peptide 

(B) LOCATION: 36 .. 51 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13 \ 



His Pro Gin Arg Leu 
1 5 



Pro Arg Met Gin Glu Asp Ser Pro Leu Gly Gly 
10 15 



(2) INFORMATION FOR SEQ ID NO: 14: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 aWino acids 

(B) TYPE: amino Wcid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(V) FRAGMENT TYPE: internal 



(xi) SEQUENCE DESCRIPTION! SEQ ID NO: 14: 

Glu Glu Asp Ser Pro Arg Glu Glu Asp Pro Pro Gly Glu Glu Asp Leu 
1 5 \ 10 15 

Pro Gly Glu Glu Asp Leu Pr^ Gly 
20 

(2) INFORMATION FOR SEQ ID NO: 

(i) SEQUENCE CHARACTERISTICS \ 

(A) LENGTH: 13 amino aci\ds 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: peptide 
(v) FRAGMENT TYPE: internal 



(ix) FEATURE: 

(A) NAME / KEY : Peptide 

(B) LOCATION:279. .291 



(xi) 



SEQUENCE DESCRIPTION: SEQ ID NO:\15: 



Leu Glu Glu Gly bro Glu Glu Asn Ser Ala Tyr Glu Gin 
1 P 10 

(2) INFORMATION FOR SEQ ID NO: 16: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: lk amino acids 

(B) TYPE: amiiip acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: lVnear 

(ii) MOLECULE TYPE: pebtide 
(v) FRAGMENT TYPE: internal 



(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 16: 

Met Arg Arg Gin His Arg krg Gly Thr Lys Gly Gly Val Ser Tyr Arg 
1 5 \ io 15 

(2) INFORMATION FOR SEQ ID NO:\l7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 45 base pskrs 

(B) TYPE: nucleic acid\ 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomVc) 



(xi) SEQUENCE DESCRIPTION: SEQ I* NO: 17: 
GTCGCTAGCT CCATGGGTCA TATGCAGAGG TTGcifCCGGA TGCAG 
(2) INFORMATION FOR SEQ ID NO: 18: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 43 base pairs 

(B) TYPE: nucleic acid 



(C) STRANDEDNESS: single 

(D) TOPOLOGY :1 linear 

(ii) MOLECULE TYPE : | DNA (genomic) 



(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 18: 
GAAGATCTCT TACTCGAGCA TTCTCCAA.GA TCCAGCCTCT AGG 
(2) INFORMATION FOR SEQ ID\NO: 19 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 bLse pairs 

(B) TYPE: nucleic\acid 

(C) STRANDEDNESS: \double 

(D) TOPOLOGY: lintear 

(ii) MOLECULAR TYPE: DNA \( genomic) 

(A) DESCRIPTION: Aft- 2 transcription fact 

(xi) SEQUENCE DESCRIPTION: \ SEQ ID NO: 19: 

TCCCCCACCC 

(2) INFORMATION FOR SEQ ID NO : 20: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULAR TYPE: DNA (genomic )\ 

(A) DESCRIPTION: initiator \(Inr) element 

(iii) HYPOTHETICAL: NO 

(iv) ANTI SENSE: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:\20: 



CCACCCCCAT 



(2) INFORMATION FOR SEQ BED NO: 21: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: l (Abase pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESB : double 

(D) TOPOLOGY: ]\inear 

(ii) MOLECULAR TYPE: DNA (genomic) 

(A) DESCRIPTION: \p53 binding site 

(x) PUBLICATION INFORMATION: 

(A) AUTHORS: El Ddiry et al . 

(B) TITLE: "Human genomic DNA sequences define a 

consensus binding site for p53" 

(C) JOURNAL: Nat ure\ Genetics 

(D) VOLUME: 1 

(F) PAGES: 44-49 

(G) DATE: 1992 

(xi) SEQUENCE DESCRIPTION: §EQ ID NO: 21: 
AAGCTAGTCC 

(2) INFORMATION FOR SEQ ID NO: 22: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 8 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:\22: 

Leu Glu His His His His His His 
1 5 



(2) INFORMATION FOR SEQ ID NO: 23: 



(i) SEQUENCE CHARACTERISTICS: 



(A) LENGTH: lfe base pairs 

(B) TYPE: nucl\eic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: Mnear 

(ii) MOLECULE TYPE: DNA (genomic) 

(A) DESCRIPTION: \ Initiator consensus sequence 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 23: 
YYYCAYYYYY 



(2) INFORMATION FOR SEQ ID NO: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 10 base pekrs 

(B) TYPE: nucleic acid\ 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(A) DESCRIPTION: p53 bidding site 

(iii) HYPOTHETICAL: NO 
(iv) ANTISENSE: NO 

(x) PUBLICATION INFORMATION: 

(A) AUTHORS: El Deiry et aV . 

(B) TITLE: "Human genomic DNA sequences define 

consensus binding site for p53" 

(C) JOURNAL: Nature Genetics\ 

(D) VOLUME: 1 

(F) PAGES: 44-49 

(G) DATE: 1992 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 
AGGCTTGCTC 



(2) INFORMATION FOR SEQ ID NO: 25: 



(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: U amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY A linear 

(ii) MOLECULE TYPE: beptide 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 25: 
Ser Pro Xaa Xaa 



(2) INFORMATION FOR SEQ ID N© : 26: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 amino \ac ids 

(B) TYPE: amino acid\ 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ\ID NO: 26: 

Thr Pro Xaa Xaa 
1 



(2) INFORMATION FOR SEQ ID NO : 27: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 540 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: DNA (genomic) 



(A) DESCRIPTION: Proposed MN promoter 
(iii) HYPOTHETICAL: NO 
(iv) ANT I- SENSE: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 27: 

CTTGCTTTTC ATTCAAGCTC AAGTTTGTCT GCCACATACC CATTACTTAA CTCACCCTCG 6 0 

GGCTCCCCTA GCAGCCTGCC CTACCTCTTT ACCTGCTTCC TGGTGGAGTC AGGGATGTAT 12 0 

ACATGAGCTG CTTTCCCTCT CAGCCAGAGG AGATGGGGGG CCCCAGCTCC CCTGCCTTTC 18 0 

CCCTTCTGTG CCTGGAGCTG GGAAGCAGGC CAGGGTTAGC TGAGGCTGGC TGGCAAGCAG 24 0 

CTGGGTGGTG CCAGGGAGAG CCTGCATAGT GCCAGGTGGT GCCTTGGGTT CCAAGCTAGT 30 0 

CCATGGCCCC GATAACCTTC TGCCTGTGCA CACSCCTGCC CCTCACTCCA CCCCCATCCT 360 

AGCTTTGGTA TGGGGGAGAG GGCACAGGGC CAGACAAACC TGTGAGACTT TGGCTCCATC 42 0 

TCTGCAAAAG GGCGCTCTGT GAGTCAGCCT GCTCCCCTCC AGGCTTGCTC CTCCCCCACC 48 0 

CAGCTCTCGT TTCCAATGCA CGTACAGCCC GTACAOACCG TGTGCTGGGA CACCCCACAG 54 0 



(2) INFORMATION FOR SEQ ID NO : 28: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 415 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(A) DESCRIPTION: 1st MN exon 
(iii) HYPOTHETICAL: NO 
(iv) ANTI- SENSE : NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 2 J 
ACAGTCAGCC GCATGGCTCC CCTGTSCCCC AGCCCCTGGC 
CCTGCTCCAG GCCTCACTGT GCAACTGCTG CTGTCACTGC 
CCCCAGAGGT TGCCCCGGAT GCAGGAGGAT TCCCCCTTGG 
GATGACCCAC TGGGCGAGGA GGATCTGCCC AGTGAAGAGG 
CCACCCGGAG AGGAGGATCT AC C TGGAGAG GAGGATCTAC 
GAAGTTAAGC CTAAATCAGA AGAAGAGGGC TCCCTGAAGT 
GAGGCTCCTG GAGATCCTCA AGAACCCGAG AATAATGCCC 
(2) INFORMATION FOR SEQ ID NO:\29: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 0 base jkirs 

(B) TYPE: nucleic acik 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE : DNA (genomic) 

(A) DESCRIPTION: 2nd MN exon 



TCCCTCTGTT 
TGCTTCTGGT 
GAGGAGGCTC 
ATTCACCCAG 
CTGGAGAGGA 
TAGAGGATCT 
ACAGGGACAA 



GATCCCGGCC 
GCCTGTCCAT 
TTCTGGGGAA 
AGAGGAGGAT 
GGATCTACCT 
ACCTACTGTT 
AGAAG 



60 
120 
180 
240 
300 
360 
415 



(iii) HYPOTHETICAL: NO 
(iv) ANTI- SENSE: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID \NO : 29: 
GGGATGACCA GAGTCATTGG CGCTATGGAG 
(2) INFORMATION FOR SEQ ID NO : 30: 
(i) SEQUENCE CHARACTERISTICS: 



(A) LENGTH: 171 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(A) DESCRIPTION!: 3rd MN exon 
(iii) HYPOTHETICAL: NO \ 
(iv) ANT I- SENSE: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 30: 
GCGACCCGCC CTGGCCCCGG GTGTCCCCAG CCTGCGCGGG CCGCTTCCAG TCCCCGGTGG 
ATATCCGCCC CCAGCTCGCC GCCTTCTSCC CGGCCCTGCG CCCCCTGGAA CTCCTGGGCT 
TCCAGCTCCC GCCGCTCCCA GAACTGCGSC TGCGCAACAA TGGCCACAGT G 
(2) INFORMATION FOR SEQ ID NO: 

(i) SEQUENCE CHARACTERISTIC^: 

(A) LENGTH: 14 3 base pa)j.rs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single\ 

(D) TOPOLOGY: linear \ 

(ii) MOLECULE TYPE: DNA (genomic^ 

(A) DESCRIPTION: 4th MN exkn 
(iii) HYPOTHETICAL: NO * 
(iv) ANTI- SENSE: NO 



60 
12 0 
171 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:\31: 
TGCAACTGAC CCTGCCTCCT GGGCTAGAGA TGGCTCTGGG TCCCGGGCGG GAGTACCGGG 60 
CTCTGCAGCT GCATCTGCAC TGGGGGGCTG CAGGTCGTCQ GGGCTCGGAG CACACTGTGG 12 0 



AAGGCCACCG TTTCCCTGCC GAG 



(2) INFORMATION FOR SEQ 



ID NO: 32 : 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 93 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDN^SS: single 

(D) TOPOLOGY: ilinear 

(ii) MOLECULE TYPE: DNA (genomic) 

(A) DESCRIPTION: 5th MN exon 
(iii) HYPOTHETICAL: NO\ 
(iv) ANTI- SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 32: 
ATCCACGTGG TTCACCTCAG CACCGOCTTT GCCAGAGTTG ACGAGGCCTT GGGGCGCCCG 
GGAGGCCTGG CCGTGTTGGC CGCCTTTCTG GAG 
(2) INFORMATION FOR SEQ ID NOt 33: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 67 base bairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear \ 

(ii) MOLECULE TYPE: DNA (gekomic) 

(A) DESCRIPTION: 6th VlN exon 



60 
93 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3 3: 
GAGGGCCCGG AAGAAAACAG TGCCTATGAG C^GTTGCTGT CTCGCTTGGA AGAAATCGCT 6 0 

GAGGAAG \ 6 7 



(2) INFORMATION FOR SEQ ID NO: 34: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 158 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(A) DESCRIPTION: 7th MN exon 
(iii) HYPOTHETICAL: NO 
(iv) ANTI- SENSE : NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 34: 
GCTCAGAGAC TCAGGTCCCA GGACTGGACA TATCTGCACT CCTGCCCTCT GACTTCAGCC 
GCTACTTCCA ATATGAGGGG TCTCTGACTA CACCGCCCTG TGCCCAGGGT GTCATCTGGA 
CTGTGTTTAA CCAGACAGTG ATGCTGAGTG CTAAGCAG 
(2) INFORMATION FOR SEQ ID NO: 3 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 145 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(A) DESCRIPTION: 8th MN exon 
(iii) HYPOTHETICAL: NO 
(iv) ANTI -SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 35: 



CTCCACACCC TCTCTGACAC CCTGTGGGGA CCTGGTGACT CTCGGCTACA GCTGAACTTC 
CGAGCGACGC AGCCTTTGAA TGGGCGAGTG ATTGAGGCCT CCTTCCCTGC TGGAGTGGAC 
AGCAGTCCTC GGGCTGCTGA GCCAG 
(2) INFORMATION FOR SEQ ID NO : 36: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 7 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(A) DESCRIPTION: 9th MN exon 
(iii) HYPOTHETICAL: NO 
(iv) ANTI- SENSE: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 36: 
TCCAGCTGAA TTCCTGCCTG GCTGCTG 
(2) INFORMATION FOR SEQ ID NO : 37: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 82 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(A) DESCRIPTION: 10th MN exon 
(iii) HYPOTHETICAL: NO 
(iv) ANTI - SENSE : NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 37: 

GTGACATCCT AGCCCTGGTT TTTGGCCTCC TTTTTGCTGT CACCAGCGTC GCGTTCCTTG 6 0 

TGCAGATGAG AAGGCAGCAC AG 8 2 
(2) INFORMATION FOR SEQ ID NO : 38: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 191 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(A) DESCRIPTION: 11th MN exon 
(iii) HYPOTHETICAL: NO 
(iv) ANTI- SENSE : NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 38: 
AAGGGGAACC AAAGGGGGTG TGAGCTACCG CCCAGCAGAG GTAGCCGAGA CTGGAGCCTA 6 0 

GAGGCTGGAT CTTGGAGAAT GTGAGAAGCC AGCCAGAGGC ATCTGAGGGG GAGCCGGTAA 12 0 

CTGTCCTGTC CTGCTCATTA TGCCACTTCC TTTTAACTGC CAAGAAATTT TTTAAAATAA 18 0 

ATATTTATAA T 191 
(2) INFORMATION FOR SEQ ID NO: 39: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1174 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(A) DESCRIPTION: 1st MN intron 



(iii) HYPOTHETICAL: NO 
(iv) ANT I- SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 39: 

GTAAGTGGTC ATCAATCTCC AAATCCAGGT TCCAGGAGGT TCATGACTCC CCTCCCATAC 60 

CCCAGCCTAG GCTCTGTTCA CTCAGGGAAG GAGGGGAGAC TGTACTCCCC ACAGAAGCCC 12 0 

TTCCAGAGGT CCCATACCAA TATCCCCATC CCCACTCTCG GAGGTAGAAA GGGACAGATG 18 0 

TGGAGAGAAA ATAAAAAGGG TGCAAAAGGA GAGAGGTGAG CTGGATGAGA TGGGAGAGAA 240 

GGGGGAGGCT GGAGAAGAGA AAGGGATGAG AACTGCAGAT GAGAGAAAAA ATGTGCAGAC 3 00 

AGAGGAAAAA AATAGGTGGA GAAGGAGAGT CAGAGAGTTT GAGGGGAAGA GAAAAGGAAA 3 60 

GCTTGGGAGG TGAAGTGGGT ACCAGAGACA AGCAAGAAGA GCTGGTAGAA GTCATCTCAT 42 0 

CTTAGGCTAC AATGAGGAAT TGAGACCTAG GAAGAAGGGA CACAGCAGGT AGAGAAACGT 48 0 

GGCTTCTTGA CTCCCAAGCC AGGAATTTGG GGAAAGGGGT TGGAGACCAT ACAAGGCAGA 54 0 

GGGATGAGTG GGGAGAAGAA AGAAGGGAGA AAGGAAAGAT GGTGTACTCA CTCATTTGGG 60 0 

ACTCAGGACT GAAGTGCCCA CTCACTTTTT TTTTTTTTTT TTTTGAGACA AACTTTCACT 66 0 

TTTGTTGCCC AGGCTGGAGT GCAATGGCGC GATCTCGGCT CACTGCAACC TCCACCTCCC 72 0 

GGGTTCAAGT GATTCTCCTG CCTCAGCCTC TAGCCAAGTA GCTGCGATTA CAGGCATGCG 78 0 

CCACCACGCC CGGCTAATTT TTGTATTTTT AGTAGAGACG GGGTTTCGCC ATGTTGGTCA 84 0 

GGCTGGTCTC GAACTCCTGA TCTCAGGTGA TCCAACCACC CTGGCCTCCC AAAGTGCTGG 90 0 

GATTATAGGC GTGAGCCACA GCGCCTGGCC TGAAGCAGCC ACTCACTTTT ACAGACCCTA 96 0 

AGACAATGAT TGCAAGCTGG TAGGATTGCT GTTTGGCCCA CCCAGCTGCG GTGTTGAGTT 102 0 

TGGGTGCGGT CTCCTGTGCT TTGCACCTGG CCCGCTTAAG GCATTTGTTA CCCGTAATGC 108 0 

TCCTGTAAGG CATCTGCGTT TGTGACATCG TTTTGGTCGC CAGGAAGGGA TTGGGGCTCT 114 0 

AAGCTTGAGC GGTTCATCCT TTTCATTTAT ACAG 1174 



(2) INFORMATION FOR SEQ ID NO: 40: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 193 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(A) DESCRIPTION: 2nd MN intron 
(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 40: 
GTGAGACACC CACCCGCTGC ACAGACCCAA TCTGGGAACC CAGCTCTGTG GATCTCCCCT 
ACAGCCGTCC CTGAACACTG GTCCCGGGCG TCCCACCCGC CGCCCACCGT CCCACCCCCT 
CACCTTTTCT ACCCGGGTTC CCTAAGTTCC TGACCTAGGC GTCAGACTTC CTCACTATAC 
TCTCCCACCC CAG 

(2) INFORMATION FOR SEQ ID NO: 41: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 131 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(A) DESCRIPTION: 3rd MN intron 
(iii) HYPOTHETICAL: NO 



(iv) ANTI- SENSE: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 41: 



GTGAGGGGGT CTCCCCGCCG AGACTTGGGG ATGGGGCGGG GCGCAGGGAA GGGAACCGTC 60 

GCGCAGTGCC TGCCCGGGGG TTGGGCTGGC CCTACCGGGC GGGGCCGGCT CACTTGCCTC 12 0 

TCCCTACGCA G 131 
(2) INFORMATION FOR SEQ ID NO: 42: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 8 9 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(A) DESCRIPTION: 4th MN intron 
(iii) HYPOTHETICAL: NO 
(iv) ANTI- SENSE: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 42: 
GTGAGCGCGG ACTGGCCGAG AAGGGGCAAA GGAGCGGGGC GGACGGGGGC CAGAGACGTG 6 0 

GCCCTCTCCT ACCCTCGTGT CCTTTTCAG 8 9 

(2) INFORMATION FOR SEQ ID NO: 43: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14 00 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: DNA (genomic) 



(A) DESCRIPTION: 5th MN intron 
(iii) HYPOTHETICAL: NO 
(iv) ANT I- SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 43: 

GTACCAGATC CTGGACACCC CCTACTCCCC GCTTTCCCAT CCCATGCTCC TCCCGGACTC 60 

TATCGTGGAG CCAGAGACCC CATCCCAGCA AGCTCACTCA GGCCCCTGGC TGACAAACTC 12 0 

ATTCACGCAC TGTTTGTTCA TTTAACACCC ACTGTGAACC AGGCACCAGC CCCCAACAAG 18 0 

GATT CTGAAG CTGTAGGTCC TTGCCTCTAA GGAGC CCACA GCCAGTGGGG GAGGCTGACA 24 0 

TGACAGACAC ATAGGAAGGA CATAGTAAAG ATGGTGGTCA CAGAGGAGGT GACACTTAAA 3 00 

GCCTTCACTG GTAGAAAAGA AAAGGAGGTG TTCATTGCAG AGGAAACAGA ATGTGCAAAG 3 60 

ACT C AGAATA TGGCCTATTT AGGGAATGGC TACATACACC ATGATTAGAG GAGGCCCAGT 42 0 

AAAGGGAAGG GATGGT GAGA TGCCTGCTAG GTTCACTCAC TCACTTTTAT TTATTTATTT 480 

ATTTTTTTGA CAGTCTCTCT GTCGCCCAGG CTGGAGTGCA GTGGTGTGAT CTTGGGTCAC 54 0 

TGCAACTTCC GCCTCCCGGG TTCAAGGGAT TCTCCTGCCT CAGCTTCCTG AGTAGCTGGG 600 

GTTACAGGTG TGTGCCACCA TGCCCAGCTA ATTTTTTTTT GTATTTTTAG TAGACAGGGT 66 0 

TTCACCATGT TGGTCAGGCT GGT CTCAAAC TCCTGGCCTC AAGTGAT CCG CCTGACTCAG 72 0 

GCTACCAAAG TGCTGATTAC AAGTGTGAGC CACCGTGCCC AGCCACACTC ACTGATTCTT 7 80 

TAATGC CAGC CACACAGCAC AAAGTTCAGA GAAATGCCTC CATCATAGCA TGTCAATATG 840 

TTCATACTCT TAGGTTCATG ATGTTCTTAA CATTAGGTTC ATAAGCAAAA TAAGAAAAAA 900 

GAATAATAAA TAAAAGAAGT GGCATGTCAG GACCTCACCT GAAAAGC CAA ACACAGAATC 960 

ATGAAGGTGA ATGCAGAGGT GACACCAACA CAAAGGTGTA TATATGGTTT CCTGTGGGGA 102 0 



GTATGTACGG AGGCAGCAGT GAGTGAGACT GCAAACGTCA GAAGGGCACG GGTCACTGAG 1080 

AGCCTAGTAT CCTAGTAAAG TGGGCTCTCT CCCTCTCTCT CCAGCTTGTC ATTGAAAACC 114 0 

AGTCCACCAA GCTTGTTGGT TCGCACAGCA AGAGTACATA GAGTT TGAAA TAATACATAG 12 00 

GATTTTAAGA GGGAGACACT GTCTCTAAAA AAAAAAACAA CAGCAACAAC AAAAAGCAAC 12 6 0 

AACCATTACA ATTTTATGTT CCCTCAGCAT TCTCAGAGCT GAGGAATGGG AG AGGAC TAT 132 0 

GGGAACCCCC TTCATGTTCC GGCCTTCAGC CATGGCCCTG GATACATGCA CTCATCTGTC 13 8 0 

TTACAATGTC ATTCCCCCAG 140 0 
(2) INFORMATION FOR SEQ ID NO: 44: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 13 34 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(A) DESCRIPTION: 6th MN intron 
(iii) HYPOTHETICAL: NO 
(iv) ANTI- SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 44: 

GTCAGTTTGT TGGTCTGGCC ACTAATCTCT GTGGCCTAGT TCATAAAGAA TCACCCTTTG 60 

GAGCTTCAGG TCTGAGGCTG GAGATGGGCT CCCTCCAGTG CAGGAGGGAT TGAAGCATGA 12 0 

GCCAGCGCTC ATCTTGATAA TAACCATGAA GCTGACAGAC ACAGTTACCC GCAAACGGCT 18 0 

GCCTACAGAT TGAAAACCAA GCAAAAACCG CCGGGCACGG TGGCTCACGC CTGTAATCCC 24 0 

AGCACTTTGG GAGGCCAAGG CAGGTGGATC ACGAGGTCAA GAGATCAAGA CCATCCTGGC 3 00 



CAACATGGTG AAACCCCATC TCTACTAAAA ATACGAAAAA ATAGCCAGGC GTGGTGGCGG 3 60 

GTGCCTGTAA TCCCAGCTAC TCGGGAGGCT GAGGCAGGAG AATGGCATGA ACCCGGGAGG 42 0 

CAGAAGTTGC AGTGAGCCGA GATCGTGCCA CTGCACTCCA GCCTGGGCAA CAGAGCGAGA 4 80 

CTCTTGTCTC AAAAAAAAAA AAAAAAAAGA AAACCAAGCA AAAACCAAAA TGAGACAAAA 54 0 

AAAACAAGAC CAAAAAATGG TGTTTGGAAA TTGTCAAGGT CAAGTCTGGA GAGCTAAACT 6 00 

TTTTCTGAGA ACTGTTTATC TTTAATAAGC ATCAAATATT TTAACTTTGT AAATACTTTT 6 60 

GTTGGAAATC GTTCTCTTCT TAGTCACTCT TGGGTCATTT TAAATCTCAC TTACTCTACT 72 0 

AGACCTTTTA GGTTTCTGCT AGACTAGGTA GAACTCTGCC TTTGCATTTC TTGTGTCTGT 780 

TTTGTATAGT TATCAATATT CATATTTATT TACAAGTTAT TCAGATCATT TTTTCTTTTC 840 

TTTTTTTTTT TTTTTTTTTT TTTTACATCT TTAGTAGAGA CAGGGTTTCA CCATATTGGC 9 00 

CAGGCTGCTC TCAAACTCCT GAC CTTGTGA TCCACCAGCC TCGGCCTCCC AAAGTGCTGG 9 60 

GATTCATTTT TTCTTTTTAA TTTGCTCTGG GCTTAAACTT GTGGCCCAGC AC TTTAT GAT 102 0 

GGTACACAGA GTTAAGAGTG TAGACTCAGA CGGTCTTTCT TCTTTCCTTC TCTTCCTTCC 10 8 0 

TCCCTTCCCT CCCACCTTCC CTTCTCTCCT TCCTTTCTTT CTTCCTCTCT TGCTTCCTCA 114 0 

GGCCTCTTCC AGTTGCTCCA AAGCCCTGTA CTTTTTTTTG AGTTAACGTC TTATGGGAAG 12 0 0 

GGCCTGCACT TAGTGAAGAA GTGGTCTCAG AGTTGAGTTA CCTTGGCTTC TGGGAGGTGA 12 6 0 

AACTGTATCC CTATACCCTG AAGCTTTAAG GGGGTGCAAT GTAGATGAGA CCCCAACATA 132 0 

GATCCTCTTC ACAG 1334 
(2) INFORMATION FOR SEQ ID NO: 45: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 512 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(A) DESCRIPTION: 7th MN intron 
(iii) HYPOTHETICAL: NO 
(iv) ANTI- SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 45: 

GTGGGCCTGG GGTGTGTGTG GACACAGTGG GTGCGGGGGA AAGAGGATGT AAGATGAGAT 6 0 

GAGAAACAGG AGAAGAAAGA AATCAAGGCT GGGCTCTGTG GCTTACGCCT ATAATCCCAC 12 0 

CACGTTGGGA GGCTGAGGTG GGAGAATGGT TTGAGCCCAG GAGTTCAAGA CAAGGCGGGG 18 0 

CAACATAGTG TGACCCCATC TCTACCAAAA AAACCCCAAC AAAACCAAAA ATAGCCGGGC 24 0 

ATGGTGGTAT GCGGCCTAGT CCCAGCTACT CAAGGAGGCT GAGGTGGGAA GATCGCTTGA 30 0 

TTC CAGGAGT TTGAGACTGC AGTGAGCTAT GATCCCACCA CTGCCTACCA TCTTTAGGAT 36 0 

ACATTTATTT ATTTATAAAA GAAATCAAGA GGCTGGATGG GGAA.TACAGG AGCTGGAGGG 42 0 

TGGAGCCCTG AGGTGCTGGT TGTGAGCTGG CCTGGGACCC TTGTTTCCTG TCATGCCATG 48 0 

AACCCACCCA CACTGTCCAC TGACCTCCCT AG 512 
(2) INFORMATION FOR SEQ ID NO : 46: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 114 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(A) DESCRIPTION: 8th MN intron 
(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 46: 
GTACAGCTTT GTCTGGTTTC CCCCCAGCCA GTAGTCCCTT ATCCTCCCAT GTGTGTGCCA 60 
GTGTCTGTCA TTGGTGGTCA CAGCCCGCCT CTCACATCTC CTTTTTCTCT CCAG 114 
(2) INFORMATION FOR SEQ ID NO: 47: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 617 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(A) DESCRIPTION: 9th MN intron 
(iii) HYPOTHETICAL: NO 
(iv) ANT I- SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 47: 
GTGAGTCTGC CCCTCCTCTT GGTCCTGATG CCAGGAGACT CCTCAGCACC ATTCAGCCCC 60 
AGGGCTGCTC AGGACCGCCT CTGCTCCCTC TCCTTTTCTG CAGAACAGAC CCCAACCCCA 12 0 

ATATTAGAGA GGC AGAT CAT GGTGGGGATT CCCCCATTGT CCCCAGAGGC TAATTGATTA 18 0 

GAATGAAGCT TGAGAAATCT CCCAGCATCC CTCTCGCAAA AGAATCCCCC CCCCTTTTTT 240 
TAAAGATAGG GTCTCACTCT GTTTGCCCCA GGCTGGGGTG TTGTGGCACG ATCATAGCTC 3 00 

ACTGCAGCCT CGAACTCCTA GGCTCAGGCA ATCCTTTCAC CTTAGCTTCT CAAAGCACTG 3 60 

GGACTGTAGG CATGAGCCAC TGTGCCTGGC CCCAAACGGC CCTTTTACTT GGCTTTTAGG 42 0 

AAGCAAAAAC GGTGCTTATC TTACCCCTTC TCGTGTATCC ACCCTCATCC CTTGGCTGGC 4 80 

CTCTTCTGGA GACTGAGGCA CTATGGGGCT GCCTGAGAAC TCGGGGCAGG GGTGGTGGAG 54 0 



TGCACTGAGG CAGGTGTTGA GGAACTCTGC AGACCCCTCT TCCTTCCCAA AGCAGCCCTC 



TCTGCTCTCC ATCGCAG 



600 
617 



(2) INFORMATION FOR SEQ ID NO : 48: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 13 0 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(A) DESCRIPTION: 10th MN intron 
(iii) HYPOTHETICAL: NO 
(iv) ANTI -SENSE : NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 48: 

GTATTACACT GACCCTTTCT TCAGGCACAA GCTTCCCCCA CCCTTGTGGA GTCACTTCAT 60 

GCAAAGCGCA TGCAAATGAG CTGCTCCTGG GCCAGTTTTC TGATTAGCCT TTCCTGTTGT 12 0 

GTACACACAG 13 0 
(2) INFORMATION FOR SEQ ID NO: 49: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14 01 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(A) DESCRIPTION: Spans 3' part of 1st intron to beyond 
end of 5th exon 



(iii) HYPOTHETICAL: NO 



(iv) ANTI- SENSE: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 49: 

CAAACTTTCA CTTTTGTTGC CCAGGCTGGA GTGCAATGGC GCGATCTCGG CTCACTGCAA 60 

CCTCCACCTC CCGGGTTCAA GTGATTCTCC TGCCTCAGCC TCTAGCCAAG TAGCTGCGAT 12 0 

TACAGGCATG CGCCACCACG CCCGGCTAAT TTTTGTATTT TTAGTAGAGA CGGGGTTTCG 18 0 

CCATGTTGGT CAGGCTGGTC TCGAACTCCT GATCTCAGGT GATCCAACCA CCCTGGCCTC 24 0 

CCAAAGTGCT GGGATTATAG GCGTGAGCCA CAGCGCCTGG CCTGAAGCAG CCACTCACTT 300 

TTACAGACCC TAAGACAATG ATTGCAAGCT GGTAGGATTG CTGTTTGGCC CACCCAGCTG 36 0 

CGGTGTTGAG TTTGGGTGCG GTCTCCTGTG CTTTGCACCT GGCCCGCTTA AGGCATTTGT 42 0 

TACCCGTAAT GCTCCTGTAA GGCATCTGCG TTTGTGACAT CGTTTTGGTC GCCAGGAAGG 48 0 

GATTGGGGCT CTAAGCTTGA GCGGTTCATC CTTTTCATTT ATACAGGGGA TGACCAGAGT 54 0 

CATTGGCGCT ATGGAGGTGA GACACCCACC CGCTGCACAG ACCCAATCTG GGAACCCAGC 60 0 

TCTGTGGATC TCCCCTACAG CCGTCCCTGA ACACTGGTCC CGGGCGTCCC ACCCGCCGCC 66 0 

CACCGTCCCA CCCCCTCACC TTTTCTACCC GGGTTCCCTA AGTTCCTGAC CTAGGCGTCA 72 0 

GACTTCCTCA CTATACTCTC CCACCCCAGG CGACCCGCCC TGGCCCCGGG TGTCCCCAGC 78 0 

CTGCGCGGGC CGCTTCCAGT CCCCGGTGGA TATCCGCCCC CAGCTCGCCG CCTTCTGCCC 84 0 

GGCCCTGCGC CCCCTGGAAC TCCTGGGCTT CCAGCTCCCG CCGCTCCCAG AACTGCGCCT 900 

GCGCAACAAT GGCCACAGTG GTGAGGGGGT CTCCCCGCCG AGACTTGGGG ATGGGGCGGG 96 0 

GCGCAGGGAA GGGAACCGTC GCGCAGTGCC TGCCCGGGGG TTGGGCTGGC CCTACCGGGC 102 0 

GGGGCCGGCT CACTTGCCTC TCCCTACGCA GTGCAACTGA CCCTGCCTCC TGGGCTAGAG 10 8 0 

ATGGCTCTGG GTCCCGGGCG GGAGTACCGG GCTCTGCAGC TGCATCTGCA CTGGGGGGCT 114 0 



GCAGGTCGTC CGGGCTCGGA GCACACTGTG GAAGGCCACC GTTTCCCTGC CGAGGTGAGC 



1200 



GCGGACTGGC CGAGAAGGGG CAAAGGAGCG GGGCGGACGG GGGC CAGAGA CGTGGCCCTC 12 6 0 

TCCTACCCTC GTGTCCTTTT CAGATCCACG TGGTTCACCT CAGCACCGCC TTTGCCAGAG 132 0 

TTGACGAGGC CTTGGGGCGC CCGGGAGGCC TGGCCGTGTT GGCCGCCTTT CTGGAGGTAC 13 8 0 

CAGATCCTGG ACACCCCCTA C 14 01 
(2) INFORMATION FOR SEQ ID NO: 50: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 98 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(A) DESCRIPTION: Region of homology to collagen alpha 
1 chain 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 50: 

Gin Arg Leu Pro Arg Met Gin Glu Asp Ser Pro Leu Gly Gly Gly Ser 
15 10 15 

Ser Gly Glu Asp Asp Pro Leu Gly Glu Glu Asp Leu Pro Ser Glu Glu 
20 25 30 

Asp Ser Pro Arg Glu Glu Asp Pro Pro Gly Glu Glu Asp Leu Pro Gly 
35 40 45 

Glu Glu Asp Leu Pro Gly Glu Glu Asp Leu Pro Glu Val Lys Pro Lys 
50 55 60 

Ser Glu Glu Glu Gly Ser Leu Lys Leu Glu Asp Leu Pro Thr Val Glu 
65 70 75 80 

Ala Pro Gly Asp Pro Gin Glu Pro Gin Asn Asn Ala His Arg Asp Lys 
85 90 95 



Glu Gly 



INFORMATION FOR SEQ ID NO: 51: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 256 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(A) DESCRIPTION: carbonic anhydrase domain 



(xi) SEQUENCE DESCRII 

Asp Asp Gin Ser His 
1 5 

Val Ser Pro Ala Cys 
20 

Pro Gin Leu Ala Ala 
35 

Gly Phe Gin Leu Pro 
50 

His Ser Val Gin Leu 
65 

Pro Gly Arg Glu Tyr 
85 

Ala Gly Arg Pro Gly 
100 

Ala Glu lie His Val 
115 

Glu Ala Leu Gly Arg 
130 



'TION: SEQ ID NO: 51: 

Trp Arg Tyr Gly Gly Asp 
10 

Ala Gly Arg Phe Gin Ser 
25 

Phe Cys Pro Ala Leu Arg 
40 

Pro Leu Pro Glu Leu Arg 
55 

Thr Leu Pro Pro Gly Leu 
70 75 

Arg Ala Leu Gin Leu His 
90 

Ser Glu His Thr Val Glu 
105 

Val His Leu Ser Thr Ala 
12 0 

Pro Gly Gly Leu Ala Val 
135 



Pro Pro Trp Pro Arg 
15 

Pro Val Asp lie Arg 
30 

Pro Leu Glu Leu Leu 
45 

Leu Arg Asn Asn Gly 
60 

Glu Met Ala Leu Gly 
80 

Leu His Trp Gly Ala 
95 

Gly His Arg Phe Pro 
110 

Phe Ala Arg Val Asp 
125 

Leu Ala Ala Phe Leu 
140 



Glu Glu Gly Pro Glu Glu Asn Ser Ala Tyr Glu Gin Leu Leu Ser Arg 
145 150 155 160 



Leu Glu Glu lie 



Asp lie Ser Ala 
180 

Glu Gly Ser Leu 
195 

Val Phe Asn Gin 
210 

Ser Asp Thr Leu 
225 

Arg Ala Thr Gin 



Ala Glu Glu Gly 
165 

Leu Leu Pro Ser 



Thr Thr Pro Pro 
200 

Thr Val Met Leu 
215 

Trp Gly Pro Gly 
230 

Pro Leu Asn Gly 
245 



Ser Glu Thr Gin 
170 

Asp Phe Ser Arg 
185 

Cys Ala Gin Gly 



Ser Ala Lys Gin 
220 

Asp Ser Arg Leu 
235 

Arg Val He Glu 
250 



Val Pro Gly Leu 
175 

Tyr Phe Gin Tyr 
190 

Val He Trp Thr 
205 

Leu His Thr Leu 



Gin Leu Asn Phe 
240 

Ala Ser Phe Pro 
255 



(2) INFORMATION FOR SEQ ID NO: 52: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 0 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(A) DESCRIPTION: transmembrane region 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 52: 

Asp He Leu Ala Leu Val Phe Gly Leu Leu Phe Ala Val Thr Ser Val 
15 10 15 

Ala Phe Leu Val 
20 

(2) INFORMATION FOR SEQ ID NO: 53: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 amino acids 

(B) TYPE: amino acid 



(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: peptide 

(A) DESCRIPTION: intracellular C- terminus 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 53: 

Met Arg Arg Gin His Arg Arg Gly Thr Lys Gly Gly Val Ser Tyr Arg 
15 10 15 

Pro Ala Glu Val Ala Glu Thr Gly Ala 
20 25 

INFORMATION FOR SEQ ID NO: 54: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 17 0 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 54: 

Arg Ala Leu Gin Leu His Leu His Trp Gly Ala Ala Gly Arg Pro Gly 
15 10 15 

Ser Glu His Thr Val Glu Gly His Arg Phe Pro Ala Glu lie His Val 
20 25 30 

Val His Leu Ser Thr Ala Phe Ala Arg Val Asp Glu Ala Leu Gly Arg 
35 40 45 

Pro Gly Gly Leu Ala Val Leu Ala Ala Phe Leu Glu Glu Gly Pro Glu 

50 55 60 

Glu Asn Ser Ala Tyr Glu Gin Leu Leu Ser Arg Leu Glu Glu lie Ala 
65 70 75 80 

Glu Glu Gly Ser Glu Thr Gin Val Pro Gly Leu Asp lie Ser Ala Leu 



Leu Pro Ser Asp Phe Ser Arg Tyr Phe Gin Tyr Glu Gly Ser Leu Thr 
100 105 110 

Thr Pro Pro Cys Ala Gin Gly Val lie Trp Thr Val Phe Asn Gin Thr 
115 120 125 

Val Met Leu Ser Ala Lys Gin Leu His Thr Leu Ser Asp Thr Leu Trp 
130 135 140 

Gly Pro Gly Asp Ser Arg Leu Gin Leu Asn Phe Arg Ala Thr Gin Pro 
145 150 155 160 

Leu Asn Gly Arg Val lie Glu Ala Ser Phe 
165 170 

(2) INFORMATION FOR SEQ ID NO: 55: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 70 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: RNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 55: 

CAUGGCCCCG AUAACCUUCU GCCUGUGCAC ACACCUGCCC CUCACUCCAC CCCCAUCCUA 60 

GCUUUGGUAU GGGGGAGAGG GCACAGGGCC AGACAAACCU GUGAGACUUU GGCUCCAUCU 12 0 

CUGCAAAAGG GCGCUCUGUG AGUCAGCCUG CUCCCCUCCA GGCUUGCUCC UCCCCCACCC 18 0 

AGCUCUCGUU UCCAAUGCAC GUACAGCCCG UACACACCGU GUGCUGGGAC ACCCCACAGU 240 

CAGCCGCAUG GCUCCCCUGU GCCCCAGCCC CUGGCUCCCU CUGUUGAUCC CGGCCCCUGC 3 00 

UCCAGGCCUC ACUGUGCAAC UGCUGCUGUC ACUGCUGCUU CUGGUGCCUG UCCAUCCCCA 3 60 

GAGGUUGCCC CGGAUGCAGG AGGAUUCCCC CUUGGGAGGA GGCUCUUCUG GGGAAGAUGA 42 0 



CCCACUGGGC GAGGAGGAUC UGCCCAGUGA AGAGGAUUCA CCCAGAGAGG 
(2) INFORMATION FOR SEQ ID NO: 56: 
(i) SEQUENCE CHARACTERISTICS: 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 56: 

This sequence is intentionally skipped. 

(2) INFORMATION FOR SEQ ID NO : 57: 
(i) SEQUENCE CHARACTERISTICS: 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 57: 

This sequence is intentionally skipped. 

(2) INFORMATION FOR SEQ ID NO : 58: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 904 base pairs 

(B) TYPE : nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iv) ANTI- SENSE: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 58: 
GCTGGTCTCG AACTCCTGGA CTCAAGCAAT CCACCCACCT CAGCCTCCCA AAATGAGGGA 
CCGTGTCTTA TTCATTTCCA TGTCCCTAGT CCATAGCCCA GTGCTGGACC TATGGTAGTA 
CTAAATAAAT ATTTGTTGAA TGCAATAGTA AATAGCATTT CAGGGAGCAA GAAC TAGATT 
AACAAAGGTG GTAAAAGGTT TGGAGAAAAA AATAATAGTT TAATTTGGCT AGAGTATGAG 



GGAGAGTAGT AGGAGACAAG ATGGAAAGGT CTCTTGGGCA AGGTTTTGAA GGAAGTTGGA 3 00 

AGTCAGAAGT ACACAATGTG CATATCGTGG CAGGCAGTGG GGAGCCAATG AAGGCTTTTG 3 60 

AGCAGGAGAG TAATGTGTTG AAAAATAAAT ATAGGTTAAA CCTATCAGAG CCCCTCTGAC 42 0 

ACATACACTT GCTTTTCATT CAAGCTCAAG TTTGTCTCCC ACATACCCAT TACTTAACTC 4 80 

ACCCTCGGGC TCCCCTAGCA GCCTGCCCTA CCTCTTTACC TGCTTCCTGG TGGAGTCAGG 54 0 

GATGTATACA TGAGCTGCTT TCCCTCTCAG CCAGAGGACA TGGGGGGCCC CAGCTCCCCT 60 0 

GCCTTTCCCC TTCTGTGCCT GGAGCTGGGA AGCAGGCCAG GGTTAGCTGA GGCTGGCTGG 660 

CAAGCAGCTG GGTGGTGCCA GGGAGAGCCT GCATAGTGCC AGGTGGTGCC TTGGGTTCCA 72 0 

AGCTAGTCCA TGGCCCCGAT AAC CTTCTGC CTGTGCACAC ACCTGCCCCT CACTCCACCC 780 

CCATCCTAGC TTTGGTATGG GGGAGAGGGC ACAGGGCCAG ACAAACCTGT GAGACTTTGG 84 0 

CTCCATCTCT GCAAAAGGGC GCTCTGTGAG TCAGCCTGCT CCCCTCCAGG CTTGCTCCTC 90 0 

CCCC 904 
(2) INFORMATION FOR SEQ ID NO: 59: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 92 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iv) ANT I- SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 59: 
TTTTTTTGAG ACGGAGTCTT GCATCTGTCA TGCCCAGGCT GGAGTAGCAG TGGTGC CAT C 60 
TCGGCTCACT GCAAGCTCCA CCTCCCGAGT TCACGCCATT TTCCTGCCTC AGCCTCCCGA 12 0 



GTAGCTGGGA CTACAGGCGC CCGCCACCAT GCCCGGCTAA TTTTTTGTAT TTTTGGTAGA 
GACGGGGTTT CACCGTGTTA GCCAGAATGG TCTCGATCTC CTGACTTCGT GATCCACCCG 
CCTCGGCCTC CCAAAGTTCT GGGATTACAG GTGTGAGCCA CCGCACCTGG CC 
(2) INFORMATION FOR SEQ ID NO : 60: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 262 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iv) ANT I- SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 60: 
TTCTTTTTTG AGACAGGGTC TTGCTCTGTC ACCCAGGCCA GAGTGCAATG GTACAGTCTC 
AGCTCACTGC AGCCTCAACC GCCTCGGCTC AAACCATCAT CCCATTTCAG CCTCCTGAGT 
AGCTGGGACT ACAGGCACAT GCCATTACAC CTGGCTAATT TTTTTGTATT TCTAGTAGAG 
ACAGGGTTTG GCCATGTTGC CCGGGCTGGT CTCGAACTCC TGGACTCAAG CAATCCACCC 
ACCTCAGCCT CCCAAAATGA GG 
(2) INFORMATION FOR SEQ ID NO: 61: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 94 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(iii) HYPOTHETICAL: NO 
(iv) ANTI -SENSE: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 61: 

TTTTTTTTTG AGACAAACTT TCACTTTTGT TGCCCAGGCT GGAGTGCAAT GGCGCGATCT 6 0 

CGGCTCACTG CAACCTCCAC CTCCCGGGTT CAAGTGATTC TCCTGCCTCA GCCTCTAGCC 12 0 

AAGTAGCTGC GATTACAGGC ATGCGCCACC ACGCCCGGCT AATTTTTGTA TTTTTAGTAG 18 0 

AGACGGGGTT TCGCCATGTT GGTCAGGCTG GTCTCGAACT CCTGATCTCA GGTGATCCAA 24 0 

CCACCCTGGC CTCCCAAAGT GCTGGGATTA TAGGCGTGAG CCACAGCGCC TGGC 2 94 
(2) INFORMATION FOR SEQ ID NO: 62: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 76 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iv) ANTI -SENSE: NO 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 62: 
TGACAGTCTC TCTGTCGCCC AGGCTGGAGT GCAGTGGTGT GATCTTGGGT 
TCCGCCTCCC GGGTTCAAGG GATTCTCCTG CCTCAGCTTC CTGAGTAGCT 
GTGTGTGCCA CCATGCCCAG CTAATTTTTT TTTGTATTTT TAGTAGACAG 
TGTTGGTCAG GCTGGTCTCA AACTCCTGGC CTCAAGTGAT CCGCCTGACT 
AAGTGCTGAT TACAAGTGTG AGCCACCGTG CCCAGC 



CACTGCAACT 60 
GGGGTTACAG 12 0 

GGTTTCACCA 18 0 

CAGCCTACCA 24 0 

276 



(2) INFORMATION FOR SEQ ID NO: 63: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 289 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iv) ANTI- SENSE: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 63: 

CGCCGGGCAC GGTGGCTCAC GCCTGTAATC CCAGCACTTT GGGAGGCCAA GGCAGGTGGA 6 0 

TCACGAGGTC AAGAGATCAA GACCATCCTG GCCAACATGG TGAAACCCCA TCTCTACTAA 12 0 

AAATACGAAA AAATAGCCAG GCGTGGTGGC GGGTGCCTGT AATCCCAGCT ACTCGGGAGG 18 0 

CTGAGGCAGG AGAATGGCAT GAACCCGGGA GGCAGAAGTT GCAGTGAGCC GAGATCGTGC 24 0 

CACTGCACTC CAGCCTGGGC AACAGAGCGA GACTCTTGTC TCAAAAAAA 28 9 
(2) INFORMATION FOR SEQ ID NO: 64: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 98 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 



(iv) ANTI -SENSE: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 64: 

AGGCTGGGCT CTGTGGCTTA CGCCTATAAT CCCACCACGT TGGGAGGCTG AGGTGGGAGA 6 0 

ATGGTTTGAG CCCAGGAGTT CAAGACAAGG CGGGGCAACA TAGTGTGACC CCATCTCTAC 12 0 

CAAAAAAACC CCAACAAAAC CAAAAATAGC CGGGCATGGT GGTATGCGGC CTAGTCCCAG 18 0 

CTACTCAAGG AGGCTGAGGT GGGAAGATCG CTTGATTCCA GGAGTTTGAG AC TGCAGTGA 24 0 

GCTATGATCC CACCACTGCC TACCATCTTT AGGATACATT TATTTATTTA TAAAAGAA 298 
(2) INFORMATION FOR SEQ ID NO: 65: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 5 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: DNA (genomic) 



(iii) HYPOTHETICAL: NO 



(iv) ANT I - SENSE : NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 65: 



TTTTTTACAT CTTTAGTAGA GACAGGGTTT CAC CATATTG GCCAGGCTGC TCTCAAACTC 6 0 

CTGACCTTGT GAT C C AC C AG CCTCGGCCTC CCAAAGTGCT GGGAT 105 



(2) INFORMATION FOR SEQ ID NO: 66: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 83 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(iii) HYPOTHETICAL: NO 



(iv) ANTI-SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 66: 
CCTCGAACTC CTAGGCTCAG GCAATCCTTT CACCTTAGCT TCTCAAAGCA CTGGGACTGT 
AGGCATGAGC CACTGTGCCT GGC 
(2) INFORMATION FOR SEQ ID NO: 67: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 11 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(A) DESCRIPTION: 5' donor consensus splice sequence 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 67: 
AGAAGGTAAG T 

(2) INFORMATION FOR SEQ ID NO : 68: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 11 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(A) DESCRIPTION: 5' donor consensus splice sequence 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 68: 
TGGAGGTGAG A 

(2) INFORMATION FOR SEQ ID NO : 69: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: li base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(A) DESCRIPTION: 5' donor consensus splice sequence 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 69: 

CAGTCGTGAG G 

(2) INFORMATION FOR SEQ ID NO : 70: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: li base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(A) DESCRIPTION: 5' donor consensus splice s< 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 70: 
C C GAGGT GAG C 

(2) INFORMATION FOR SEQ ID NO: 71: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 11 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(A) DESCRIPTION: 5' donor consensus splice sequence 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 71: 
TGGAGGTACC A 

(2) INFORMATION FOR SEQ ID NO: 72: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 11 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(A) DESCRIPTION: 5' donor consensus splice sequence 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 72: 
GGAAGGTCAG T 

(2) INFORMATION FOR SEQ ID NO: 73: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 11 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(A) DESCRIPTION: 5' donor consensus splice sequence 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 73: 
AGCAGGTGGG C 

(2) INFORMATION FOR SEQ ID NO: 74: 
(i) SEQUENCE CHARACTERISTICS: 



(A) LENGTH: 11 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(A) DESCRIPTION: 5' donor consensus splice sequence 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 74: 
G C CAGGTAC A G 

(2) INFORMATION FOR SEQ ID NO : 75: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 11 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(A) DESCRIPTION: 5' donor consensus splice sequence 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 75: 
TGCTGGTGAG T 

(2) INFORMATION FOR SEQ ID NO: 76: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 11 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(A) DESCRIPTION: 5' donor consensus splice sequence 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 76: 
ATACAGGGGAT 

(2) INFORMATION FOR SEQ ID NO: 77: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 11 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(A) DESCRIPTION: 3' acceptor consensus splice sequence 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 77: 
ATACAGGGGA T 

(2) INFORMATION FOR SEQ ID NO : 78: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 11 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(A) DESCRIPTION: 3' acceptor consensus splice sequence 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 78: 
CCCCAGGCGA C 

(2) INFORMATION FOR SEQ ID NO : 79: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 11 base pairs 



(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(A) DESCRIPTION: 3' acceptor consensus splice sequence 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 79: 
ACGCAGTGCA A 11 
(2) INFORMATION FOR SEQ ID NO: 80: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 11 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(A) DESCRIPTION: 3' acceptor consensus splice sequence 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 8 0: 
TTTCAGATCC A 11 
(2) INFORMATION FOR SEQ ID NO: 81: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 11 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(A) DESCRIPTION: 3 1 acceptor consensus splice sequence 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 81: 



CCCCAGGAGG G 

(2) INFORMATION FOR SEQ ID NO: 82: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 11 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE : DNA (genomic) 

(A) DESCRIPTION: 3' acceptor consensus spli 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 82: 
TCACAGGCTC A 

(2) INFORMATION FOR SEQ ID NO : 83: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 11 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(A) DESCRIPTION: 3' acceptor consensus spli 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 83: 
CCCTAGCTCC A 

(2) INFORMATION FOR SEQ ID NO: 84: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: li base pairs 

(B) TYPE: nucleic acid 



(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(A) DESCRIPTION: 3' acceptor 



consensus splice sequence 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 84: 
CTCCAGTCCA G 11 
(2) INFORMATION FOR SEQ ID MO : 85: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 12 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(A) DESCRIPTION: 3' acceptor consensus splice sequence 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 85: 
TCGCAGGTGA CA 



(2) INFORMATION FOR SEQ ID NO : 86: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 11 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(A) DESCRIPTION: 3' acceptor consensus splice sequence 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 
ACACAGAAGG G 



